C:/Users/Vassilis/Documents/research/Paper Submissions/Conferences/ICASSP-2011-clustering/revision/robust_clustering_revised.dvi
نویسندگان
چکیده
Clustering is a basic task in a variety of machine learning applications. Partitioning a set of input vectors into compact, wellseparated subsets can be severely affected by the presence of modelincompatible inputs called outliers. The present paper develops robust clustering algorithms for jointly partitioning the data and identifying the outliers. The novel approach relies on translating scarcity of outliers to sparsity in a judiciously defined domain, to robustify three widely used clustering schemes: hard K-means, fuzzy K-means, and probabilistic clustering. Cluster centers and assignments are iteratively updated in closed form. The developed outlieraware algorithms are guaranteed to converge, while their computational complexity is of the same order as their outlier-agnostic counterparts. Preliminary simulations validate the analytical claims.
منابع مشابه
Reporting quality of submissions to the National Conferences on Electronic Learning in Medical Education: implications from Iranian research performance
Background: Reporting quality of research on medical education has come under scrutiny in recent years in wake of empirical evidence. Poor reporting quality of published abstracts may distract readers from careful reading of research evidence or in a worst case mislead scientists. Main objective of this study was to evaluate the extent and quality of the submitted abstracts to the 3rd and 4th N...
متن کاملTowards a SIGOPS Policy on Subsequent Publications
The Problem Often, during the course of a research project, papers get published on similar topics, perhaps with overlapping content and contributions. For example, a research group might publish a 5-page workshop paper (e.g. at HotOS), later produce an extended version of this paper for a conference (like SOSP), and then produce a revised version for a journal (like TOCS); another version of t...
متن کاملStoring and Querying Multiversion XML Documents using Durable Node Numbers
Managing multiple versions of XML documents represents an important problem for many traditional applications, such as software configuration control, as well as new ones, such as link permanence of web documents. Research on managing multiversion XML documents seeks to provide efficient and robust techniques for storing, retrieving and querying such documents. In this paper, we present a novel...
متن کاملODBASE 2013 PC Co-Chairs Message
We are happy to present the papers of the 10th International Conference on Ontologies DataBases, and Applications of Semantics, ODBASE 2011, held in Heraklion, Crete (Greece), in October 2011. The ODBASE conference series provides a forum for research on the use of ontologies and data semantics in novel applications, and continues to draw a highly diverse body of researchers and practitioners b...
متن کاملSofterware: Replace SAS Programs with XML Documents to Help People and Computers Be Happier with Each Other
In a novel use of electronic documents to replace labor-intensive programming, a SAS-based integration has been implemented that uses XML to automate the creation, revision, and reuse of the publication-quality statistical tables prominent in regulatory submissions. Table content and style revision is XML document-based, not programming-based. The XML documents that define style and content can...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011